106 research outputs found

    Explainable artificial intelligence through graph theory by generalized social network analysis-based classifier

    Get PDF
    We propose a new type of supervised visual machine learning classifier, GSNAc, based on graph theory and social network analysis techniques. In a previous study, we employed social network analysis techniques and introduced a novel classification model (called Social Network Analysis-based Classifier-SNAc) which efficiently works with time-series numerical datasets. In this study, we have extended SNAc to work with any type of tabular data by showing its classification efficiency on a broader collection of datasets that may contain numerical and categorical features. This version of GSNAc simply works by transforming traditional tabular data into a network where samples of the tabular dataset are represented as nodes and similarities between the samples are reflected as edges connecting the corresponding nodes. The raw network graph is further simplified and enriched by its edge space to extract a visualizable 'graph classifier model-GCM'. The concept of the GSNAc classification model relies on the study of node similarities over network graphs. In the prediction step, the GSNAc model maps test nodes into GCM, and evaluates their average similarity to classes by employing vectorial and topological metrics. The novel side of this research lies in transforming multidimensional data into a 2D visualizable domain. This is realized by converting a conventional dataset into a network of 'samples' and predicting classes after a careful and detailed network analysis. We exhibit the classification performance of GSNAc as an effective classifier by comparing it with several well-established machine learning classifiers using some popular benchmark datasets. GSNAc has demonstrated superior or comparable performance compared to other classifiers. Additionally, it introduces a visually comprehensible process for the benefit of end-users. As a result, the spin-off contribution of GSNAc lies in the interpretability of the prediction task since the process is human-comprehensible; and it is highly visual

    The role of machine learning in identifying students at-risk and minimizing failure

    Get PDF
    Education is very important for students' future success. The performance of students can be supported by the extra assignments and projects given by the instructors for students with low performance. However, a major problem is that students at-risk cannot be identified early. This situation is being investigated by various researchers using Machine Learning techniques. Machine learning is used in a variety of areas and has also begun to be used to identify students at-risk early and to provide support by instructors. This research paper discusses the performance results found using Machine learning algorithms to identify at-risk students and minimize student failure. The main purpose of this project is to create a hybrid model using the ensemble stacking method and to predict at-risk students using this model. We used machine learning algorithms such as Naive Bayes, Random Forest, Decision Tree, K-Nearest Neighbors, Support Vector Machine, AdaBoost Classifier and Logistic Regression in this project. The performance of each machine learning algorithm presented in the project was measured with various metrics. Thus, the hybrid model by combining algorithms that give the best prediction results is presented in this study. The data set containing the demographic and academic information of the students was used to train and test the model. In addition, a web application developed for the effective use of the hybrid model and for obtaining prediction results is presented in the report. In the proposed method, it has been realized that stratified k-fold cross validation and hyperparameter optimization techniques increased the performance of the models. The hybrid ensemble model was tested with a combination of two different datasets to understand the importance of the data features. In first combination, the accuracy of the hybrid model was obtained as 94.8% by using both demographic and academic data. In the second combination, when only academic data was used, the accuracy of the hybrid model increased to 98.4%. This study focuses on predicting the performance of at-risk students early. Thus, teachers will be able to provide extra assistance to students with low performance

    A survey of machine learning-based methods for COVID-19 medical image analysis

    Get PDF
    The ongoing COVID-19 pandemic caused by the SARS-CoV-2 virus has already resulted in 6.6 million deaths with more than 637 million people infected after only 30 months since the first occurrences of the disease in December 2019. Hence, rapid and accurate detection and diagnosis of the disease is the first priority all over the world. Researchers have been working on various methods for COVID-19 detection and as the disease infects lungs, lung image analysis has become a popular research area for detecting the presence of the disease. Medical images from chest X-rays (CXR), computed tomography (CT) images, and lung ultrasound images have been used by automated image analysis systems in artificial intelligence (AI)- and machine learning (ML)-based approaches. Various existing and novel ML, deep learning (DL), transfer learning (TL), and hybrid models have been applied for detecting and classifying COVID-19, segmentation of infected regions, assessing the severity, and tracking patient progress from medical images of COVID-19 patients. In this paper, a comprehensive review of some recent approaches on COVID-19-based image analyses is provided surveying the contributions of existing research efforts, the available image datasets, and the performance metrics used in recent works. The challenges and future research scopes to address the progress of the fight against COVID-19 from the AI perspective are also discussed. The main objective of this paper is therefore to provide a summary of the research works done in COVID detection and analysis from medical image datasets using ML, DL, and TL models by analyzing their novelty and efficiency while mentioning other COVID-19-based review/survey researches to deliver a brief overview on the maximum amount of information on COVID-19-based existing researches. [Figure not available: see fulltext.

    Interactive framework for Covid-19 detection and segmentation with feedback facility for dynamically improved accuracy and trust

    Get PDF
    Due to the severity and speed of spread of the ongoing Covid-19 pandemic, fast but accurate diagnosis of Covid-19 patients has become a crucial task. Achievements in this respect might enlighten future efforts for the containment of other possible pandemics. Researchers from various fields have been trying to provide novel ideas for models or systems to identify Covid-19 patients from different medical and non-medical data. AI-based researchers have also been trying to contribute to this area by mostly providing novel approaches of automated systems using convolutional neural network (CNN) and deep neural network (DNN) for Covid-19 detection and diagnosis. Due to the efficiency of deep learning (DL) and transfer learning (TL) models in classification and segmentation tasks, most of the recent AI-based researches proposed various DL and TL models for Covid-19 detection and infected region segmentation from chest medical images like X-rays or CT images. This paper describes a web-based application framework for Covid-19 lung infection detection and segmentation. The proposed framework is characterized by a feedback mechanism for self learning and tuning. It uses variations of three popular DL models, namely Mask R-CNN, UNet, and U-Net++. The models were trained, evaluated and tested using CT images of Covid patients which were collected from two different sources. The web application provide a simple user friendly interface to process the CT images from various resources using the chosen models, thresholds and other parameters to generate the decisions on detection and segmentation. The models achieve high performance scores for Dice similarity, Jaccard similarity, accuracy, loss, and precision values. The U-Net model outperformed the other models with more than 98% accuracy

    A comparative study of different pre-trained deeplearning models and custom CNN for pancreatic tumor detection

    Get PDF
    Artificial Intelligence and its sub-branches like MachineLearning (ML) and Deep Learning (DL) applications have the potential to have positive effects that can directly affect human life. Medical imaging is briefly making the internal structure of the human body visible with various methods. With deep learning models, cancer detection, which is one of the most lethal diseases in the world, can be made possible with high accuracy. Pancreatic Tumor detection, which is one of the cancer types with the highest fatality rate, is one of the main targets of this project, together with the data set of computed tomography images,which is one of the medical imaging techniques and has an effective structure in Pancreatic Cancer imaging. In the field of image classification, which is a computer vision task, the transfer learning technique, which has gained popularity in recent years, has been applied quite frequently. Using pre-trained models werepreviously trained on a fairly large dataset and using them on medical images is common nowadays. The main objective of this article is to use this method, which is very popular inthe medical imaging field, in the detection of PDAC, one of the deadliest types of pancreatic cancer, and to investigate how it per-forms compared to the custom model created and trained from scratch. The pre-trained models which are used in this project areVGG-16 and ResNet, which are popular Convolutional Neutral Network models, for Pancreatic Tumor Detection task. With the use of these models, early diagnosis of pancreatic cancer, which progresses insidiously and therefore does not spread to neighboring tissues and organs when the treatment process is started, may be possible. Due to the abundance of medical images reviewed by medical professionals, which is one of the main causes for heavy workload of healthcare systems, this applicationcan assist radiologists and other specialists in Pancreatic Tumor detection by providing faster and more accurate method

    The Unification and Assessment of Multi-Objective Clustering Results of Categorical Datasets with H-Confidence Metric

    Get PDF
    Abstract: Multi objective clustering is one focused area of multi objective optimization. Multi objective optimization attracted many researchers in several areas over a decade. Utilizing multi objective clustering mainly considers multiple objectives simultaneously and results with several natural clustering solutions. Obtained result set suggests different point of views for solving the clustering problem. This paper assumes all potential solutions belong to different experts and in overall; ensemble of solutions finally has been utilized for finding the final natural clustering. We have tested on categorical datasets and compared them against single objective clustering result in terms of purity and distance measure of k-modes clustering. Our clustering results have been assessed to find the most natural clustering. Our results get hold of existing classes decided by human experts

    Implementation and performance analysis of maximally containment technique for information integration /

    No full text

    Data on cut-edge for spatial clustering based on proximity graphs

    Get PDF
    Cluster analysis plays a significant role regarding automating such a knowledge discovery process in spatial data mining. A good clustering algorithm supports two essential conditions, namely high intra-cluster similarity and low inter-cluster similarity. Maximized intra-cluster/within-cluster similarity produces low distances between data points inside the same cluster. However, minimized inter-cluster/between-cluster similarity increases the distance between data points in different clusters by furthering them apart from each other. We previously presented a spatial clustering algorithm, abbreviated CutESC (Cut-Edge for Spatial Clustering) with a graph-based approach. The data presented in this article is related to and supportive to the research paper entitled "CutESC: Cutting edge spatial clustering technique based on proximity graphs" (Aksac et al., 2019) [1], where interpretation research data presented here is available. In this article, we share the parametric version of our algorithm named CutESC-P, the best parameter settings for the experiments, the additional analyses and some additional information related to the proposed algorithm (CutESC) in [1]

    A comprehensive approach for validating p53 binding site predictions

    No full text
    8th International Conference on Information Technology (2017 : Amman; Jordan)Predicting the locations of Response Elements (RE) has received considerable attention in the field of gene sequence analysis and bioinformatics. Protein53 (p53) has a prominent role in the cell cycle and cancer prevention; it functions as a transcription factor and binds with p53 REs in the DNA. The identification of p53 response elements enlightens the unknown functions and characteristics of p53 besides the genes containing binding sites. In this work, we have proposed an algorithm for validating the prediction of the possible p53 binding sites in the human genome, by incorporating the recent findings on the p53 REs into our suggested profile hidden Markov model (PHMM). We constructed two PHMMs and the results described in this paper are very promising. In the experiments, we have used the p53 REs data reported by Riley et al. [21]. © 2017 IEEE
    corecore